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►^ Abstract 

00 ' We present a new algorithm for constructing a Chevalley basis for any 

Cn , Chevalley Lie algebra over a finite field. This is a necessary component for 

some constructive recognition algorithms of exceptional quasisimple groups of 
Lie type. When applied to a simple Chevalley Lie algebra in characteristic 

CO . P > 5, our algorithm has complexity involving the 7th power of the Lie rank, 

^D ' which is likely to be close to best possible. 

(N , 

1 Introduction 

^\ i Finding a Chevalley basis for a semisimple Lie algebra over C amounts to diag- 

^ I onalizing a regular semisimple element: the eigenspaces for non-zero eigenvectors 

" " " are just the 1-dimensional root spaces, and suitable eigenvectors can be chosen as 

described by Carter jT]. Indeed, the same is true for any Chevalley Lie algebra 
over any algebraically closed field. However, over a finite field the problem is much 
more difficult. The probability that a random regular semisimple element is split is 
approximately the reciprocal of the order of the Weyl group, so something better 
than a random search is required if we want a polynomial-time algorithm. 

Let us define a toral subalgebra of a Lie algebra [ to be any abelian subalgebra 
consisting of semisimple elements. If t is a maximal toral subalgebra which is split, 
then its centralizer in I is a Cartan subalgebra c, and conversely, i consists exactly 
of the semisimple elements in c. 

Problem 1 Given a split toral subalgebra f)o in a Chevalley Lie algebra I, find a 
Cartan subalgebra () such that t)o C f), and a Chevalley basis with respect to f). 

Problem 2 Civen two Cartan subalgebras f)i, f)2 of a semisimple Lie algebra I, find 
an element g e Aut(l) such that (]^ = t)2. 



Solutions to these problems are a necessary component for some constructive 
recognition algorithms of exceptional quasisimple groups of Lie type [5 . A polynomial- 
time Las Vegas algorithm for solving Problem [1] is given by Ryba [7; , except in 
characteristic 2 (where indeed Problem [T] has no solution in general) , and except 
for 02 and g2 in characteristic 3. This algorithm has complexity involving the 11th 
power of the Lie rank of the algebra, as well as the fourth power of the logarithm 
of the field order, although practical implementations are apparently much faster 
than this suggests. He asserts that his algorithm often works in characteristic 2, 
but does not attempt a full analysis in that case. 

Another algorithm is given by Cohen and Murray [3], with the same exceptions, 
with complexity (in the case when the input is an algebra corresponding to a simple 
algebraic group) involving the 9th power of the Lie rank. (A noteworthy feature 
of their algorithm is that the rate-determining step seems to be checking at each 
stage whether they have finished. It is possible therefore that their algorithm can be 
improved by a more subtle approach to this particular step.) They do not discuss 
the exceptional cases. 

The small characteristic exceptions are discussed by Cohen and Roozemond [4] , 
but they only consider the problem of finding a Chevalley basis once a Cartan sub- 
algebra has been found. They do not solve the problem of finding such a subalgebra 
in the first place. (This problem is dealt with by Roozemond in [5].) Problem [3] 
amounts to finding a base-change matrix which maps one Chevalley basis to another, 
so is easily reduced to Problem [I] as will be discussed at the end of Section [51 

In this paper we propose a simpler algorithm which has better complexity than 
the above algorithms in the simple case. We achieve this by computing the whole 
Chevalley basis at once, rather than by first computing the Cartan subalgebra. We 
build up the Dynkin diagram one node at a time, making each connected component 
in full before moving on to the next. Our main theorem is as follows. 

Theorem 1 Let I be a Chevalley Lie algebra over a field of order q and character- 
istic p > 5. Suppose I has Lie rank I and dimension d. Then there is an algorithm 
to compute a Chevalley basis of I in 0{ld'^ \ogq) field operations. 

2 The main algorithm 

We assume that the characteristic of the field is at least 5. In this case our strategy 
is to look for a (long or short root) fundamental ai, and find its Chevalley basis 
{e,f,h}. Then we look for another fundamental ai which extends it to a simple 
rank 2 algebra (if there is one). Continuing in this way, we build up the connected 
component of the Dynkin diagram one node at a time. Then we iterate the pro- 
cedure until all components are dealt with. Once all components are completed, 
we use the 'extraspecial pairs' as described by Carter [1] to complete the Chevalley 
basis for the corresponding simple Lie subalgebras. The algorithm in detail is as 
follows. (Comments on 'suitable' choices follow the algorithm.) 

1. Input: a Chevalley Lie algebra [q over a finite field of characteristic p > 5, 
and a split toral subalgebra f)o (defaulting to zero). 

2. Output: a Cartan subalgebra f) containing f)o, together with the part of a 
Chevalley basis for [q, consisting of the Ca^fonha for simple roots a, and a 
complete weight space decomposition W of [q. 

3. Initialise f)i :— 0. Initialise f) := 0. Initialise T) := 0. 

4. If f)o 7^ 0, compute the weight spaces for [)o, and set W equal to the set of 
weight spaces, and pair the weight-spaces for opposite non-zero weights. 



Else pick a random a; e [q and compute the eigenspaces of adx on Iq, until 
there are some non-trivial eigenspaces with non-zero eigenvalues, and set W 
equal to this set of eigenspaces, paired as before. Adjoin to W the perp of W, 
so that W spans the whole space. 

5. Until W consists of a single subspace which is abelian, 

(a) Using the current W, find an ai subalgebra, as follows: 

i. Until there is a pair of opposite 1-dimensional members of W, pick 
a pair of opposite spaces V~^ , V~ G W with dim V~^ minimal, and 
pick random y £ V^ and z G V^ , and let x = [y, z] £ [F+, V~], and 
refine the members of W using the eigenspaces of adx and the perp. 
Recompute the pairing of members of W. 
ii. Pick e S V~^ and / G V~ , and set h = [e, /]. Then scale h so 
that [h,e] — 2e; then scale / so that [e, /] = h. Set [)i := (h). Set 
P:={e,/,/i}. 
iii. Compute the eigenspaces of adh. Refine W using these eigenspaces. 
Label each element of W by the corresponding eigenvalue of adh. 
(This label is the first coordinate of what will become the weight 
vector.) 

(b) Repeat until a maximal string diagram has been found. 

i. Repeat: pick a suitable label ([}i-weight) w where the next node of 
the diagram might live, and a pair V^ , V^ of opposite spaces in W, 
with labels ±w, and random x G [y+, F"], and compute eigenspaces 
of ada; on V~^ and V~ ; until ada; has a pair of 1-dimensional eigenspaces 
(e) C y+, (/) C V^ for non-zero eigenvalues ±A. 

ii. Set h — [e, /]. Then scale h so that [h, e] — 2e; then scale / so that 
[ej]^h. 

iii. Adjoin h to [}i. Adjoin to T> the vectors e, /, h. 

iv. Compute the eigenspaces of adh. Refine W using these eigenspaces. 
Append to the label of each element of W the corresponding eigen- 
value of adh. 

(c) Analyse the string diagram obtained in the previous step, to see whether 
or not it is equal to the Dynkin diagram of the current component, using 
the data and notation from Table [TJ 

i. If the diagram has just two nodes, then for both end nodes, compute 
dim Vi , dim V2 and dim V3 to determine both what the diagram is 
and what it should be: the only case where it could be wrong, is 
when the diagram is A2 but should be G2. 

ii. Else compute dim V2 for both end nodes and one interior node. 

iii. If one of these is > 1 and 7^ 7, then the diagram is i3„ or C„, and if 
Bn, is correct; if C„, delete an end node to obtain C„. 

iv. If one of these is 7, the diagram is F4 or B4. Distinguish these by 
considering the node adjacent to the short end node. If it is F4, 
the diagram is correct. Otherwise, compute dimVi for the short 
end node: if this is 0, the diagram is correct. Otherwise, it is a B4 
diagram but should be F4. 

V. Else all nodes of the diagram are long, and the diagram is An. Com- 
pute dim Vi to determine what the diagram should be. The possible 
cases where the diagram should not be An are as follows: Dn+i 
(any n); n = 2,G2; n = 3, Dfe(fc > 4); 71 = A, Eg, n = 5,Ee,Er; 
n = 7,Et,Es; n = 8, Eg. 



Table 1: Dimensions of weight spaces for h in a simple Lie algebra 



Type 


Root 


dim Vi 


dim V2 


dim V3 


An 




2(n- 


1) 


1 




Dn 




4(n- 


2) 


1 




Eq 




20 




1 




Ej 




32 




1 




Es 




56 




1 




Bn 


short 







2n-l 




Bn 


long 


An- 


6 


1 




Cn 


short 


4(n- 


2) 


3 




Cn 


long 


2(n- 


1) 


1 




Fi 


short 


8 




7 




Fi 


long 


14 




1 




G2 


short 


2 




1 


2 


G2 


long 


4 




1 





Note: V\ denotes the eigenspace with eigenvalue A. Since dimy_A = dimVA, we 
omit half the eigenvalues. We assume that the characteristic of the field is at least 
7: obvious modifications of this table apply in smaller characteristics. 

(d) If the type of the diagram is not what we know it should be, adjust it to 
be the diagram of the current component, as follows, correcting ()i, W 
and V as we go: 

i. A2 instead of G2: repeat the above steps until roots of both lengths 

are found, 
ii. i?4 instead of -F4: remove the long end node, and attach a new (short) 

node at the other end. 
iii. An instead of -Dn+i^ adjoin a node to the penultimate node, 
iv. A3 instead of _D„: attach a tail to the middle node. 
V. Am instead of En'- attach a tail to a suitable node. 

(e) Write out V, all of W which consists of 1-spaces labelled by non-zero 
weights, together with these labels. 

Adjoin t)i to t). 

Remove the part of W which has been written out, and initialise labels 

to 0. Initialise 2? := 0. 

(f) If W = [W], pick a random x £ W and compute the eigenspaces of 
adx on VF, until there are some non-trivial eigenspaces with non-zero 
eigenvalues, and set W equal to this set of eigenspaces, paired as before. 
(If this fails, then W is probably abelian, so break.) Adjoin to W the 
perp of W, so that W spans the whole space. 

6. Now (] is a subspace of the single element of W, so adjoin to () a complement. 
Write out (). 



Comments on the algorithm. In Step 5(ii)(a), we construct the string diagram 
by repeatedly trying to attach a node to the previous one. This means looking in 
the weight space corresponding to the weight (0, . . . , 0, 1) or (0, ... , 0, 2). It is clear 
from Table [T] that the former case pertains except in the case Bn at the first step, if 
a short root has been found. When this process terminates, we reverse the string, 
(and the corresponding orderings of V and [51, and the labels on elements of W) 



and try again, using the same weights. When this also terminates, a maximal string 
subdiagram has been found. 

In Step 5(iv)(e) there are various cases, which we now describe in more detail. 
In each case the Am diagram we have found is a maximal Am subdiagram of the 
extended Dynkin diagram En- If m = 4 (so n = 8), we attach a tail of length 4 
to one of the two interior nodes: a priori, we do not know which, so try both. If 
m — 5 and n = 6, attach a node to the middle node. If to = 5 and n = 7, attach a 
tail of length 2 to a node adjacent to one of the end nodes: again we do not know 
in advance which one, so try both. If to, = 7 and n = 7, try to attach a node to 
the middle node, after removing either one of the end nodes. If m = 7 and ti = 8, 
we try the following procedure for each node adjacent to an end node: first remove 
the far end node, and then attach a tail of length 2. Finally, if to. = n = 8, do the 
same for the nodes at distance 2 from each end. 

The Chevalley basis. At completion of the main algorithm, we have obtained a 
Cartan subalgebra, and a complete set of root vectors for the fundamental roots and 
their negatives. We also have a set of vectors which are scalar multiples of the other 
root vectors. It remains to complete this to a Chevalley basis of the commutator 
subalgebra by computing the correct scalar multiples of these. 

We assume that for every abstract Dynkin diagram, a choice of structure con- 
stants has been made (see Chapter 4 of Wi)- Then we scale each Ca+p in turn to 
ensure that [ea,e^] is the appropriate multiple (0,±1,±2,±3) of Ca+p- This re- 
quires the characteristic to be at least 5 in the case of a component G2 , and at least 
3 in the cases of a component Bn,Cn, F4. In each case, to compute the scalar, it 
sufBces to compute one non-zero coordinate of [ea,e^]. This can be accomplished 
by computing just one column of ade^ and applying it to Cq. Once all these scalars 
have been computed, we have a complete Chevalley basis for [Iq, (q]- 

Solution to Problem [2l In the case when [I, [] = [ we may use our algorithm 
with input i)i to produce a Chevalley basis containing a basis oii)i, and again with 
input [}2. Then any linear map which takes the first basis to the second, preserving 
the labelling of the root system, will be an automorphism of the algebra mapping 
f)i to ()2, as required. 

3 Analysis of the algorithm 

We first analyse the algorithm and its complexity in the case when the input algebra 
is simple and no partial Cartan subalgebra is given. 

Let I be the Lie rank, and d ^ P the dimension of the algebra, and let q be the 
order of the field. 

Computation of adx for a random vector x takes 0{d^) field operations. To 
compute a pair of eigenspaces for non-zero eigenvalues ±A (which we do not com- 
pute), we use [5], which takes O(d'^logg) field operations. Computing [x,y] also 
takes 0{d^) field operations, for example by computing ady and applying it to x. 

At the start of the algorithm (Step 4) we are looking for an element x such 
that ada; has a pair ±A of non-zero eigenvalues. The proportion of such elements 
is at least a constant, say 1/3 (see Corollary 6.3 of [3]). Hence this step can be 
accomplished in 0{d^ logq) field operations. 

In the simple case the main loop (Step 5) will be traversed only once. In Step 
5(i)(a), (and similarly in Step 5(ii)(a)) the commutator [y,z] is in effect a random 
matrix of small rank in the centralizer of the part of the Cartan subalgebra that we 
have seen. The statistics of this situation are at least as good as the statistics for 
a random element. Thus Step 5(i)(a) takes a constant number of 0{d^ logg) steps. 



To justify Step 5(i)(b) we need to show that e and / generate a spUt ai subalgebra. 
This foUows from the Jacobi identity for x, e, / and for x, h, e. That is 

[h,x\ = [[e,f],x] = [[e,x]J] + [e,[f,x\] = [Ae, /] - [e,A/] =0, 

and 

[[/i,e],x] = [[h,x\,e\ + [h,[e,x\] =0+ [/i, Ae] 

so [/i, e] is a A-eigenvector of ada; so is a scalar multiple of e. Hence, from the 
representation theory of sb we know in particular that adft, is diagonalisable. Thus 
Step 5(i)(b) works, and takes a constant number of 0{cfi) steps. Moreover, the 
eigenvalues of adh lie in {0, ±1, ±2, ±3} so its eigenspaces can also be computed in 
0{d^) field operations. 

Step 5(ii)(a) is done (at most) once for each fundamental root, and the computa- 
tions each time are essentially the same as in Step 5(i). Hence this takes 0{ld?' \ogq) 
field operations. Step 5(ii)(b) consists of at most a constant number of eigenspace 
computations for known eigenvalues, so takes 0{(P) operations. Step 5(ii)(c) is 
similar to 5(ii)(a), and might be done 0{l) times if we were in the case where we 
mistook Di for A3. 

Steps 5(v) and 5(vi) are book-keeping and termination so do not take significant 
time. 

The final step of computing the scalars for each weight space for a non-simple 
root takes 0{(P) field operations for each root. Thus this computation can be done 
in time 0{d^). 

Hence the overall complexity in the simple case is 

0{ld^ log q) ^ 0{f log q) = 0{d^-^ log q) 
field operations. The proof of Theorem [T] is now complete. 

4 Non-simple algebras 

Semisimple case. We have designed our algorithm to apply to the semisimple 
case, by ensuring that in Step 5(i) we at least halve the dimension every time we 
find a new eigenspace. Hence this step needs to be applied at most logd times to 
find an Oi in the first component. Since each application of Step 5(i) or Step 5(ii) 
reduces the rank by 1, the overall complexity becomes 0{ld'^ log d log q). 

Non-trivial centres. The part of the centre which is generated by commutators 
is part of the output of the algorithm. The rest of the centre plays no role, and we 
can pick an arbitrary basis for it. 

Imperfect algebras. In this case, extra non-central toral elements appear in the 
final step of the algorithm. However, in general it is not possible to scale these 
to any particularly nice form. For example, such an h may act non-trivially on 
multiple components, and it is only possible to scale it to act canonically on one 
component. If the derived subalgebra has large homogeneous components and large 
codimension, this makes the definition of a canonical basis almost impossible. 

In certain cases, however, it is possible to extend our algorithm. For example, 
if the derived subalgebra is simple, then there is at most one dimension of non- 
central torus outside the derived subalgebra, and we can make a canonical choice 
of element. For example, we can demand that [ei,h] — SuCi, where e^ correspond 
to the fundamental roots. 



5 Characteristics 2 and 3 

Characteristic 3. The main problem in small characteristics is that in certain 
cases the weight spaces are not 1-dimensional. There may be additional problems for 
small fields. In characteristic 3 we only encounter problems with multidimensional 
weight spaces in the cases where the Lie algebra has a component of type g2, or a 
simply-connected component of type 02- In both these cases, there are eigenspaces 
of dimension 3. Consider first the case 32- In this case, the short roots occur in 
weight spaces of dimension 1, so these are obtained with high probability in the 
same way as above, i.e. by looking for a short root a2. Then we need to modify the 
algorithm in Step 5(ii)(b) to test whether this a2 should he a, Q2: specifically, we 
compute the image of ade for one of the short roots e, and test whether this lies in 
the a2 algebra. If it does not, then we deduce that the algebra generated by the 02 
and this image is the full Q2, so modify Step 5(ii)(c) accordingly, using [3]. 

The simply-connected 02 case will only arise at the end, when we have run out 
of 1-dimensional eigenspaces, and only 3-dimensional eigenspaces remain. For each 
pair of these, we compute the algebra they generate, and find a suitable basis using 
[4]. See also [6]. We expect that these modifications will not affect the overall 
complexity of our algorithm. 

The only other problem in characteristic 3 is in Step 5(ii)(b), where we cannot 
distinguish easily between long and short roots in F4 using Table [TJ In this case 
we may have picked up a B^ root subsystem rather than the whole F4. In order 
to detect this, we need to check directly whether all 48 root vectors lie in the 
algebra generated by the fundamental root vectors. If not, then we can correct the 
fundamental roots in the same way as before. 

Characteristic 2. We expect that a combination of our ideas with those of [4] 
and [6] will also produce a more efficient algorithm in characteristic 2. First we 
briefiy sketch how this might work in the simple case An- 

1. Take random x, until we have a 2-dimcnsional eigenspace of ada; with nonzero 
eigenvalue. Pick e, / at random in this eigenspace until h = [e, /] 7^ 0. 

2. Find an eigenspace V of adh with non-zero eigenvalue, and scale / and h so 
that the eigenvalue is 1. 

3. Let Ve = [V,e] n V and Vf = [VJ] n V, and pick y e Ve, z e Vf until 
X ■■= [y,z] 7^0. 

4. ado; acts on Ve and Vf, so intersect the eigenspaces of adx with Ve and Vf. 
Similarly for ad(a; -I- h). This gives us enough 1-dimensional spaces to define 
the root spaces for an 02 subalgebra. Scale the vectors as far as possible. 

5. Continue in this way to generate each node of the diagram in turn. 

More generally, there is no pairing of weight spaces, and the minimal dimension 
eigenspaces which we are aiming for have dimension at most 8 (see [U Table 1]). 
If we modify Step 5(i)(a) by taking V^ — V^ e W then we will reach such a 
small-dimensional eigenspace in at most logd steps. If this dimension is not 2 or 
4 then the component is of bounded rank, and the methods of |4] suffice. In the 
other cases, we can analyse the subalgebra generated by this eigenspace in the same 
way as in [4], or as suggested above in the dimension 2 case. We then exend to the 
whole component by a modified version of Step 5(ii)(a): we know which eigenspace 
V — V^ = V^ to look in, and if this has dimension 2 we proceed as suggested 
in Step 4 of the An algorithm above. In the dimension 4 case we again split the 
eigenspace according to the actions of the unipotent elements already found. 



However, in general in characteristic 2, not every split toral subalgebra is con- 
tained in a split maximal toral subalgebra, and therefore a heuristic algorithm such 
as we suggest may fail to produce a Cartan subalgebra. It may produce a max- 
imal split toral subalgebra which is contained only in a non-split maximal toral 
subalgebra. 
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